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Abstract. Some rigorous results and statistics of the solution space of Vertex-Covers 
on bipartite graphs are given in this paper. Based on the Konig's theorem, an exact 
solution space expression algorithm is proposed and statistical analysis of the nodes’ 
states is provided. The statistical results fit well with the algorithmic results until the 
emergence of the unfrozen core, which makes the fluctuation of statistical quantities 
and causes the replica symmetric breaking in the solutions. Besides, the entropy 
of bipartite Vertex-Cover solutions is calculated with the clustering entropy using a 
cycle simplification technique for the unfrozen core. Furthermore, as generalization of 
bipartite graphs, bipartite core graph is proposed, the solution space of which can also 
be easily determined; and based on these results, how to generate a Konig — Egervdry 
subgraph is studied by a growth process of adding edges. The investigation of solution 
space of bipartite graph Vertex-Cover provides intensive understanding and some 
insights on the solution space complexity, and will produce benefit for finding maximal 
Konig — Egervdry subgraphs, solving general graph Vertex-Cover and recognizing the 
intrinsic hardness of NP-complete problems. 
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1. introduction 

Minimum vertex cover (Vertex-Cover) problem, as one of Karp’s 21 NP-complete 
problems and a classical graph theoretical computational problem |T], has a central 
status in the research of computational complexity and attracts the interests of many 
mathematicians, physicists and computer scientists, which also has a large number of 
applications such as immunization strategies in networks [2], the prevention of denial- 
of-service attacks [3] and monitoring of internet traffic Hj. Till now, it is known that 
Vertex-Cover problem is NP-complete even in cubic graphs |5] and planar graphs |6]. 
But for bipartite graphs, Konig's theorem proves a fact that its minimum coverage is 
equal to its maximum matchings [7]. Besides, for graphs without leaf-removal core, a 
simple algorithm can help to find a minimal vertex-cover in polynomial time [8]. 

In statistical mechanics, there are a lot of results obtained on the research of 
Vertex-Cover problem. In 0 CQ], theoretical analysis of Vertex-Cover problem on 
random graphs is provided to investigate how to solve it efficiently. In a significant 
structure named long-range frustration, is proposed to explain the strong correlations 
among the nodes in Vertex-Cover, and in [12] and [13], the replica symmetric breaking 
(RSB) analysis on the minimum coverage and its entropy of random graphs is discussed. 
Recently, another relationship between the nodes, mutual-determination, is investigated 
to detect the solution space of Vertex-Cover ca, which can express the solution space 
exactly when the graph has no leaf-removal core. And, a solution number counting 
algorithm is given based on the solution space expression of Vertex-Cover [15]. In 
these research, most results can be proved right only when the graph is of simple 
structure/topology, such as trees and graphs with no leaf-removal core, which has to 
ensure the validity of replica symmetry assumption. Compared with the Vertex-Cover 
problem, the satisfiability problem especially k-SAT problem is known as another famous 
NP-complete problem [I6], but fc-XORSAT problem can be solved in polynomial time 
which undergoes both the replica symmetry and one step replica symmetric breaking 
phases (1-RSB) [17]. Can we find a similar easily-solving subset of Vertex-Cover like 
that fc-XORSAT in fc-SAT? 

Based on the Konig^s theorem [7] and the existing reduced solution graph analysis 
na, the complete solution space of Vertex-Cover on bipartite graphs will be studied in 
the paper, and a polynomial-time algorithm to obtain the exact solution space expression 
will be provided. By the results, statistical analysis of the nodes’ states in the solution 
space is done, which will lose its effectiveness when a structure named unfrozen core 
emerges. The existence of the unfrozen core accords with the replica symmetric breaking, 
as some nodes can have very large influence on others of the graph, which implicates the 
long-range correlation among nodes. Besides, the entropy and the clustering entropy 
of bipartite graph Vertex-Cover are investigated. As some generalizations, some easily- 
solving Vertex-Cover instances are constructed and how to generate a Kdnig — Egervdry 
subgraph is discussed. 
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2. Vertex-Cover solution space of bipartite graphs 

As is known to all, the classical Vertex-Cover problem is one of the six basic NP-complete 
problems |6] . For arbitrary graphs, till now there is no algorithm in polynomial time to 
hnd the minimal vertex-covers, and to detect its solution space is even harder. In this 
section, we will focus on minimal Vertex-Cover on bipartite graphs which belongs to the 
P class in computational complexity, and achieve some rigorous results on the solution 
space and its statistics. 

2.1. Rigorous solution space expression of Vertex-Cover on bipartite graphs 

In [H], the reduced solution graph is dehned to describe the solution space of Vertex- 
Cover: double edges are used to identify the mutual-determinations, in which two ends of 
the edge are unfrozen and can determine the states of each other; red or black nodes are 
used to identify the positive or negative backbones, which should be always uncovered 
or covered in the solution space of Vertex-Cover; single edges retained from the original 
graph keep the relationship among the nodes by Vertex-Cover. By the results in HI, 
the reduced solution graph can exactly express the whole solution space of Vertex-Cover 
when there is no leaf-removal core [8] on the given graph. And, in the following, this 
result can be generalized to bipartite graphs. 

Theorem: The solution space of Vertex-Cover for bipartite graphs with n nodes 
can be exactly represented by the reduced solution graph in 0{n?) steps, i.e., the states 
(unfrozen or backbone) of any node can be easily determined for bipartite graphs. 

proof : Our inference is based on a very important theorem in graph theory, the 
Konig^s theorem [^: In any bipartite graph, the number of edges in a maximum 
matching equals the number of vertices in a minimum vertex-cover. And, the maximum 
matching can be solved by an algorithm within polynomial time for general graphs |18j . 
especially for the bipartite graph it can be solved in O(n^) steps by the Hungarian 
algorithm [T9] . 

For a bipartite graph G, we can hrst get a maximum matching by the Hungarian 
algorithm, e.g., {ii,ji), (* 202 ), • • ■, {im,jm)- By the Konig^s theorem, if the maximum 
matching has m independent edges (edges without common ends), there should be m 
covered nodes in the minimal vertex-covers, which means that each independent edge 
must have one and only one covered node. Then, there should be one and only one 
covered node in each such pair of nodes {ik,jk), k = 1, • ■ • ,m, which suggests that the 
relation in each pair of matched nodes should be mutual-determinations (double edges) 
in the solution space of Vertex-Cover; and no other nodes except A, ji, ^ 2 ,^ 2 , • • ■, im,jm 
can be covered in the minimal vertex-covers, which suggests that all the other nodes 
except those matched have to be always uncovered (positive backbones) in the solution 
space of Vertex-Cover. 

For the obtained graph, if the positive backbones have neighbor nodes on the 
original graph G, these neighbors should have their states to be always covered 
(negative backbones); if any new produced negative backbone, say 4, has one mutual- 
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(a)The original bipartite graph 



(b)Find a maximum matching and 
mark them with double edges 



(c)Find the nodes not in the 
matching 



(d)Set the unmatched nodes always 
uncovered (red) 



(e)Fix nodes connecting the 
unmatched ones by Vertex-Cover 



(f)Final solution space fixed by 
freezing influence 


Figure 1. An instance for the process of obtaining the reduced solution graph of 
Vertex-Cover for bipartite graphs. 


determination {ik,jk) neighbor node jk, jk should be always uncovered (positive 
backbones) by the mutual-determination relation. This rule is followed by the 
requirements of Vertex-Cover and should be performed until no such conditions exist, 
which costs at most 0{n) steps. 

Bipartite Graph Solution Space Expression Algorithm 
INPUT: Bipartite graph G 

OUTPUT: Reduced solution graph S^{G) of G 
Freezing-Influence (Graph G, Matching 
begin 

S^{G) = make all matching edges double edges 
while (unmatched nodes exist) 

S^{G) = make all unmatched nodes uncovered backbones 
^{G) = make uncovered backbones' neighbors covered backbone 
while (any double edge (i,j) has node i covered backbone) 

^{G) = make j uncovered backbone 
=5^((7) = make j’s neighbors covered backbone 
Tetnicn{y{G)) 
end 
main 
begin 

^=maximum matching of G 
y'{G)=Freezing-Influence{G, 
end 

Then, the final obtained graph is the reduced solution graph of G. As the whole 
process is done according to the Konig^s theorem and the requirements of Vertex-Cover, 
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the reduced solution graph is an exact representation of the solution space of Vertex- 
Cover and the total time consumption is at most O(n^). □ 

According to the proof of the above theorem, a Bipartite Graph Solution Space 
Expression Algorithm is provided to obtain the reduced solution graph of a given 
bipartite graph G, and an instance for this process is shown in FiglU 


2.2. Statistical analysis of bipartite Vertex-Cover nodes’states 


Denote G = {Xi, X 2 , E) as a bipartite graph with two independent node sets Xi and 
X 2 , and the edge set E only involves edges connecting Xj and X 2 . Dehne rii, n 2 as the 
sizes of Xi and X 2 , m as the size of i?, ci = ^, C 2 = ^ as the average degrees in Xi 
and X 2 , n = ni -|- n 2 as the total nodes number, and the average degree for the whole 
graph is c = 2ciC2/(ci -1- C 2 ). Then, the probability p for the appearance of each edge 
satishes p = ^ ~ ^ ~ degree distributions of Xi and X 2 are separately 

k k 

Pi{k) = and P2{k) = when ni,n2 are sufficiently large 

m- 

Let Q be the size of giant connected component of the bipartite graph, Qi the 
node ratio in Xi and Q 2 for X 2 . Thus, one node in Xi belongs to the giant connected 
component, if and only if at least one of its neighbors in X 2 is in the giant connected 
component. So, we have 



Q, = 1 - 5^ Pi (A^) (1 - = 1 - 

k 

(1) 


Q 2 = 1 - {k) (1 - go" = 1 - 

(2) 

Thus, the 
Next, 

size of giant component for the whole graph G is g = 
we will calculate the coverage ratio of a random bipartite graph. 

Taking 


the analysis in IZH, dehne tti as the probability of an edge entering a node in Xi with 
coverage requirement, i.e., it has not been covered, and 712 is the same but with entering 
node in X 2 . Thus, an edge entering a node has coverage requirement only when the 
other end of the edge has no coverage requirement from the rest edges, and we have 


00 


^gi(fc0(l-vr2)"b 

ki=0 

00 

(3) 

5^g2(A:2)(l-vr0"0 

fc2=0 

(4) 


where qi{k) = {k l)pi{k + l)/ci = Pi{k), i = 1,2 are the probabilities of an edge 
entering a node with degree k 1 in Xj. Then, the above equations can be simplihed 
by 


-Ci7r2 


-C27ri 


TTi = e 


712 = e 


(5) 
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Using the analysis above, a node i in Xi should be uncovered only in the following two 
cases: all its neighbors have coverage requirements from their edges; all its neighbors 
except one (say j) have coverage requirements from their edges, and at this time nodes 
i and j can either be covered. The analysis is the same for nodes in X 2 , so we have the 
coverage ratios in Xi and X 2 as 


Xi = 1 - ^pi(/c)(l - TTs)^ - '^pi{k){l - TTs) 


c-i ^^2 


= 1 


A:=0 


-C17T2 


k=l 


Ci7T2 _ 


€1772 


( 6 ) 


3^2 = 1 - ^P2(fc)(l - TTi)^ - ^P2(fc)(l - VTi) 


k-Ml 


= 1 


k=0 


—C 27 ri 


k=l 


C2Vri _ 


C2TT1 


(7) 


Thus, the coverage ratios of the whole graph G is x = . 

Using similar analysis in mm, we derive the ratios of backbones and unfrozen 
nodes in the following. Let qf, qf, Qi, qf, Qf, Q 2 be the ratios of uncovered backbones, 
covered backbones and unfrozen nodes in Xi and X 2 separately, and evidently qf + qf -\- 
qi = l,q 2 + + ?2 = 7- Then, one new added node should be an uncovered backbone 

only when all its neighbors are unfrozen or covered backbones; one new added node 
should be an unfrozen node only when there is exactly one uncovered backbone in its 
neighbors. Thus, we have 


= e 

k=0 

00 

- qt)^ = (8) 

k=0 


qi = '^Pi{k)kqf{l - qf)’" ^ = Ciqfe 

k=l 

00 

(I 2 = '^P2{k)kqt{l - qt)^-^ = C2qt (9) 

k=l 

And, the ratios of the uncovered backbones and unfrozen nodes for the whole graph G 
are separately q~^ = '^1 *^^+12 *^1 gO _ gi*c 2 +g 2 *ci ^ |^pg coverage ratio of the 

whole graph G is a: = 1 — — g°/2, and it is easy to see that the results of coverage by 

equations (6-7) and equations (8-9) are actually the same. By the shape of equations 
(jS]) and dH]), the probability of an edge entering a node with coverage requirement is the 
same as the probability of a node being positive backbone. 

For Vertex-Cover of random graphs as we know, when there is a leaf-removal core, 
as the existence of long-range correlations [HI [m 122], we cannot obtain the exact results 
of covered and uncovered backbones, and the key difficulty is how to deal with the 
correlations among the neighbors of a node. Fortunately, the correlation among the 
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Statistical Results of Bipartite Graph for n ;n = 4:1 


Statistical Results of Bipartite Graph for = 4:2 
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Statistical Results of Bipartite Graph for n^:n 2 = 4:3 
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Figure 2. The results of minimal coverage, unfrozen nodes and negative backbones 
for different ratios of ni : n 2 = 4/1,4/2,4/3 with different average degrees. The 
theoretical results are shown by lines with different colors and the experimental results 
are by data with red symbols. The experimental results fit very well with those by 
equations (6-9). All the experimental results are obtained by 1000 instances with size 
ni -I- 712 = 2000. 


neighbors is easy to deal with on the bipartite graphs: if a new added node i has no 
neighbors of positive backbones, i being nncovered reqnires that all its neighbors change 
to be covered, and it will canse no confliction among these neighbors. Assnming two 
nnfrozen neighbors ji,j 2 of i cannot be covered simnltaneously, i.e., ji being covered 
causes an influence propagation to affect the state of j 2 and requires j 2 being uncovered, 
by the organization of the reduced solution graph, the number of nodes in the influence 
chain between ji and j 2 should be even, e.g., the intermediate nodes ki,k 2 , ■ ■ ■ ,k 2 n, 
and we can hnd that nodes i, ji, ki, /c 2 , • • •, k 2 n,j 2 form a cycle having 2n + 3 nodes, 
which conflicts with the organization of bipartite graph. Thus, we can always have that 
the unfrozen neighbors of a node can be covered simultaneously for bipartite graphs. 
Therefore, the long-range correlations of Vertex-Cover on bipartite graphs are easygoing 
and produce no trouble for determining nodes’ states by a node-adding process mm- 
The local evolution of the unfrozen node and negative backbone can be obtained by the 
same analysis. 

In Figj2l the results of minimal coverage, unfrozen nodes and negative backbones on 
random bipartite graphs are provided, which £t very well with the analysis of equations 
in this subsection. These statistical quantities evolve quite like those on random graphs 
P, but the minimal coverage becomes almost unchangeable after some average degrees: 
for Cl : C 2 = 4 : 1, the minimal coverage almost stays to be x = 0.2 when c >~ 3.3; 
for Cl : C 2 = 4 : 2, the minimal coverage almost stays to be x = | when c 5.5; for 
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Average Degree 


Figure 3. The results of minimal coverage, unfrozen nodes and negative backbones 
for ni : 712 = 1 with different average degrees, shown separately by blue, green and 
black lines/symbols. The experimental results fit well with those by equations (7-11) 
when c < e, but diverge when the average degree gets larger. All the experimental 
results are obtained by 1000 instances with size ni -|- 712 = 2000. 

Cl : C 2 = 4 : 3, the minimal coverage almost stays to be a; = | when c >~ 8.5. By the 
organization of bipartite graphs, it suggests that after some large average degrees, the 
minimal coverage, i.e., number of maximum matchings, should be fulhlled by the nodes 
in the independent set with smaller size; on the reduced solution graph, after some 
average degrees, almost all the nodes in the smaller independent set become negative 
backbones, and almost all the nodes in the other part become positive backbones. 

2.3. Fluctuation and unfrozen core of bipartite Vertex-Cover solution space 

In the above subsection, a statistical analysis for bipartite Vertex-Cover is provided with 
unequal sizes of two independent sets. When ci : C 2 = 1 : 1, the coverage obtained by 
these equations also hts very well with the experiments, but it is surprising that the 
results of equations (8-9) deviate from the experimental ones on calculating positive 
backbones and unfrozen nodes when the average degree c > e, which are shown in 
Figini In this subsection, a new concept, unfrozen core, will be introduced to explain 
this phenomenon. 

Unfrozen core, an organization on the reduced solution graph, is the leaf-removal 
core for the unfrozen nodes. In the unfrozen core, some unfrozen nodes couple together 
and have close connections, which suggests that there are strong correlations among 
them. When a new node is added with connection to the unfrozen core, it being 
uncovered will lead to a large number of nodes in the unfrozen core to be frozen. Thus, 
the unfrozen core is very vulnerable under the local evolution of the node states. 












Research on Solution Space of Bipartite Graph Vertex-Cover by Maximum Matchings 9 


Ave.Deg 

c = 0.5 

c = 1 

c= 1.5 

c = 2 

c = 2.5 

c = 3 

c = 3.5 

p{big ratio) 

1 

1 

1 

1 

1 

0.919 

0.930 

Ave.Deg 

c = 4 

c = 4.5 

c = 5 

c = 5.5 

c = 6 

c = 6.5 

c = 7 

p{big ratio) 

0.914 

0.877 

0.860 

0.796 

0.702 

0.636 

0.458 

Ave.Deg 

c = 7.5 

c = 8 

c = 8.5 

c = 9 

c = 9.5 

c= 10 

c= 10.5 

p{big ratio) 

0.380 

0.231 

0.164 

0.095 

0.068 

0.030 

0.021 

Ave.Deg 

c = 11 

c = 11.5 

c = 12 

c= 12.5 

c= 13 

c= 13.5 

c= 14 

p{big ratio) 

0.019 

0.005 

0.003 

0.001 

0.001 

0.000 

0.000 


Table 1. The sizes p of big ratio class of positive backbones for different average 
degrees. All the experimental results are obtained by 1000 instances with size 

ni + 71-2 = 2000. 
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Figure 4. The experimental results of positive backbones for different instances 
with ni : 712 = 1 and average degrees c = 2,6,10,14. The blue cycles provide the 
experimental results of 1000 instances for ratios of positive backbones, and the red line 
shows the average over the 1000 instances. All the instances have size rii + 712 = 2000 
here. 


In FigJU an interesting fluctuation phenomenon emerges: when the average degree 
is small (c = 2), the positive backbone ratios (the blue cycles) fluctuate around the 
average value (the red line) and the average is a typical value; when the average degree 
gets larger (c = 6), the positive backbone ratios split into two different classes, in which 
some are around 0.5 {big ratio) and the others are around 0 {small ratio)] when c = 10, 
this phenomenon gets clearer and the class of big ratio greatly reduces; when c = 14, 
the big ratio class disappears. In Table.I, the sizes p of the big ratio class are provided 
to show its evolution. 

In FigJSl the experimental results of sizes for giant component, leaf-removal core 
and unfrozen core on random bipartite graphs with ni : 77-2 = 1 : 1 are provided, in 
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Figure 5. The statistical results of giant connected component sizes, leaf-removal core 
sizes and unfrozen cores sizes for random bipartite graphs of ni : 712 = 1 with different 
average degrees. All the experimental results are obtained by 1000 instances with size 
ni n2 = 2000 . 


which the emergence of leaf-removal core and unfrozen core is simultaneously. By the 
analysis in mils], the emergence of the unfrozen core means the existence of long-range 
correlations. 

For any specific instance, the local evolution of the node states can be exact, which 
derives from the theorem of the reduced solution graph for bipartite graphs. Why the 
statistical results for positive backbones/unfrozen nodes in equations (8-9) lose their 
effect? For statistics in equations (8-9), there is a basic assumption that adding a new 
node cause little influence on the macroscopic statistical results. However, when the 
unfrozen core exists and a new node is added, if the new node is connected to the 
unfrozen core, there is a large probability that many nodes’ states will be changed 
to be frozen in the core, which is shown in FigJU by the existence of big ratio class 
of positive backbones. Indeed, by the results in FigJU and FigJS] when c > e, some 
instances have unfrozen core, but the others have no unfrozen core as many nodes in 
the core are forced to be frozen by some few nodes and the core is broken. So, one 
single node in microscopic scale, the influence of which should be neglected in statistics, 
may have profound influence on the statistical results in macroscopic scale when the 
unfrozen core exists. Thus, the unfrozen core can survive after the node-adding process 
for some instances, and for other instances, many of the unfrozen cores get damaged 
or even disappear, which are determined by the specific details of the organizations 
of bipartite graphs. Therefore, the statistical results by equations (8-9) for the node 
states are unstable and not exact when the unfrozen core exists after c > e. Similar 
as the analysis in im, the existence of the unfrozen core is quite like the leaf-removal 
core in fc-XORSAT, under which the replica symmetric breaking works. The long-range 
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correlation among nodes in the unfrozen core can also be reflected in the core entropy 
calculated in next section. 

3. Counting the solution number of bipartite Vertex-Cover 

By the above analysis, we can see that there is no odd-cycle-breaking [E] in obtaining 
the reduced solution graph of bipartite Vertex-Cover, and reduced solution graph for 
any bipartite graph can be exactly achieved. To see the organization complexity of the 
solution space of Vertex-Cover on bipartite graphs, the entropy hs = of whole 

solution space and core entropy he = are studied in this section, in which 

Sn, Sc are separately the total number of minimal vertex-covers and just the number of 
minimal vertex-covers in the unfrozen core. 

For an obtained reduced solution graph (neglecting the backbones), to know the 
core entropy, the leaf-removal unfrozen core should be achieved hrst, which is shown 
in Figj5]with instances in Figj6] (a-c). However, by the results in FigJSl not all nodes 
in the leaf-removal core belong to the unfrozen core, the unfrozen core excludes the 
backbones and performs as the intrinsic complexity, and the nodes in the unfrozen 
core have strong correlations and long-range influence [15] . By the knowledge of Vertex- 
Cover on random graph cases piinKii], solution space (ground states in the terminology 
of statistical mechanics) undergoes the replica symmetric breaking phenomenon when 
the leaf-removal core exists at the average degree c > e, and it should have a replica 
symmetry phase when there is no leaf-removal core. So, for the strong correlations 
among the nodes in the unfrozen core, we can recognize that the clustering entropy [16] 
is caused by the minimal vertex-covers in the unfrozen core, i.e., each proper assignment 
in the unfrozen core leads a cluster of solutions on the whole reduced solution graph, 
and the core entropy is indeed the clustering entropy. 

To calculate the core entropy he, we indeed calculate the number of minimal vertex- 
covers in the unfrozen core of the reduced solution graph. Here, a technique named cycle 
simplification can be performed: when there is some cycle with alternatively double and 
single edges in the unfrozen core, the nodes on the cycle mutually determine each other, 
and they can be simplihed by two nodes. In Fig|6](a-b), for the six nodes in the unfrozen 
core, it is easy to verify that the yellow nodes must take the same value in the 

solution space and so do the green nodes ji, j 2 , js, and in Fig]6](c) one yellow node i 
and one green node j are used to substitute these two classes of nodes, in which nodes 
outside the unfrozen core connect i when it originally connects ik,k = 1, 2, 3 or j when 
it originally connects jk, k = 1,2, 3. It is evident that the cycle simpliheation does not 
change the solution number but leads to new reduced solution graph with smaller size. 
Then, for the simplihed core, the Vertex-Cover Solution Number Counting Algorithm 
[T5] can be used to calculate the clustering entropy he- Then, a result for the unfrozen 
core of the reduced solution graph after cycle simpliheation will be given: 

Proposition: After the cycle simpliheation operations for N^{G), the new reduced 
solution graph N^'{G) has no unfrozen core, i.e., all the unfrozen nodes can be removed 
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(a)Reduced solution graph 



{b)Leaf-removal core 


(c)Cycle simplification 



Figure 6. Instances for leaf-removal core and cycle simplification of bipartite Vertex- 
Cover reduced solution graph. Subgraph (a) is the reduced solution graph, and the 
green and yellow nodes must take the same values separately. Subgraph (b) provides 
the leaf-removal core of (a), and subgraph (c) shows the cycle simplification for (a), 
in which the yellow node and green node in (c) correspond to those with the same 
color in (a). Subgraph (d) gives a description of how to find an aimed cycle to do 
cycle simplification, and the green nodes and edges form the aimed cycle. Subgraph 
(e) provides an instance for a core simplification, in which the above one is the original 
unfrozen core and the below one is the simplified one. Colored nodes in (e) have the 
same meaning as those in (a) and (c). 


after the leaf-removal steps. 

proof : Here, we use proof by contradiction and assume that there can be leaf- 
removal unfrozen core Uc{G) after all possible cycle simplihcation operations. In the 
following, we will prove that there must be cycles satisfying the requirement of cycle 
simplihcation in Uc{G). For the unfrozen core Uc{G) of as the connectivity of 

all the nodes in it, a spanning tree T [23] containing all the double edges of Uc{G) can 
be easily obtained. For there is no leaf in Uc{G), any additional single link of Uc{G) —T 
must produce cycles. 

For a pendant R of one leaf as the end of one branch of T, there must be some 
edge Cl in Uc{G) — T with end G, which connects some other node in T and form a 
cycle with G, and then we consider TIJ ei. When the other end (except G) of edge ei is 
one of its ancestors in its branch, by the bipartite property, the resulted cycle must be 
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Experimental Results of the Unfrozen Core Entropy of Bipartite Vertex-Cover 
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Figure 7. The experimental results of entropy and core entropy of bipartite Vertex- 
Cover for random bipartite graphs of ni : 712 = 1 with different average degrees. All 
the experimental results are obtained by 1000 instances with size ni + n 2 = 2000. 


with even number of nodes and satisfy the requirement of cycle simplihcation, and the 
contradiction is found. When the other end of ei is a node in another branch of T, if the 
resulted cycle satisfies the requirement of cycle simplification, all the nodes including R 
should be simplified and the contradiction is found; otherwise consider a new connection 
62 of the pendant I 2 in the new branch. For TIJ ei IJ 62 , if ^2 connects one of its ancestors 
or some node in the branch of G by 62 , then by the bipartite property, the resulted 
cycle must satisfy the requirement of cycle simplification and the contradiction is found. 
Similar analysis as R can be performed on I 2 , and if no aimed cycle emerges, we can 
consider some I 3 , 63 and TIJ ei [J 62 IJ ^ 3 ; '' '• For this process, there must be some step 
that the new pendant R connects with some already considered branch (assuming its 
leaf is /j). The cycle consisting of h, e*, h+i, Ci+i, ■ ■ ■ should satisfy the requirement 

of cycle simplification by the bipartite property, and the contradiction is found. One 
schematic view of this process is shown in Figl 6 ](d). Thus, when there is leaf-removal 
unfrozen core, cycle simplification operation can always be performed on it, and the 
hnal resulted graph should have no leaf-removal unfrozen core. □ 

For the entropy hg = , the number of solutions does not change after the cycle 

simplihcations, the simplified unfrozen core with the removed leaves should be consider 
together, and the calculation of hg is mainly based on the Vertex-Cover Solution Number 
Counting Algorithm [15], which is a complete algorithm when the reduced solution graph 
is exact and performs well when there is no unfrozen core. In FiglTJ the entropy of 
random bipartite Vertex-Cover with equal independent sets is provided, which evolves 
similarly as that of random graphs [I 3 |; the core entropy for 1000 instances with average 
degrees c = 3, 3.5,4,4.5 are also given and most instances have very low entropy (very 
few solutions). As the average degree increases, the unfrozen core increases as that in 








Research on Solution Space of Bipartite Graph Vertex-Cover by Maximum MatchingslA 

FigJSl but the solutions in the unfrozen core are very few, which also reveals the strong 
correlations among nodes in the unfrozen core. 

4. Easy-solving Vertex-Cover instances and Konig — Egervdry subgraph 
4 . 1 . Vertex-Cover solution space of bipartite core graph 

By the results in |8] and we know that the minimal vertex-covers can be easily 
obtained when there is no leaf-removal core for a graph. And, by the analysis of the 
above sections, the solution space of the bipartite graph can be easily obtained by the 
maximum matchings. Thus, if a graph has no leaf-removal core or the leaf-removal core 
is a bipartite graph, we name it as bipartite core graph, and its solution space expres¬ 
sion/reduced solution graph can be exactly achieved using the Mutual-determination 
and Backbone Evolution Algorithm HI and the Bipartite Graph Solution Space Expres¬ 
sion Algorithm. 

Indeed, these two algorithms can be combined simply using the idea of maximum 
matching for the bipartite core graph: each leaf (a petiole with one pendant point con¬ 
necting it) can be viewed as one matching, so the maximum matching number (except 
the core) is the same as the number of leaves; for the bipartite leaf-removal core, the 
maximum matchings are easily to be obtained. By the minimal coverage, the covered 
nodes should be in these two kinds of matchings, and nodes that are not contained in 
the matchings must be uncovered backbones. Thus, for a bipartite core graph G, we can 
first hnd its maximum matchings, mark them as double edges and unmatched nodes 
uncovered backbones; then consider the freezing influence propagation caused by the 
uncovered backbones, and £x the states of some nodes like that in the Bipartite Graph 
Solution Space Expression Algorithm] at last, check whether the odd cycle breaking op¬ 
eration [H] should be performed, and do the freezing influence operation jH] for the 
new hxed backbones. Finally, the obtained compatible graph is the reduced solution 
graph A^{G), and the algorithm is given as follows. 

As the leaves of a bipartite core graph may not be bipartite, e.g., odd cycles can 
exist (that is why odd-cycle-breaking operations are needed), the bipartite core graph 
involves not only the bipartite graph and it belongs to the Konig — Egervdry graph. 
On the contrary, a Konig — Egervdry graph cannot always be a bipartite core graph, 
and the structures or characteristics of the Konig — Egervdry graph are not very clear 
till now [23]. However, if it is known that G is a Konig — Egervdry graph, its solution 
space can be determined similarly as the Bipartite Core Graph Solution Space Expres¬ 
sion Algorithm, as all the covered nodes must be involved in the maximum matchings. 
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Bipartite Core Graph Solution Space Expression Algorithm 

INPUT: Bipartite core graph G 

OUTPUT: Reduced solution graph S^{G) of G 

begin 

Find all the leaves by the leaf-removal algorithm 
^i=assign one matching in each leaf 

^2=ni3-ximum matching of bipartite leaf-removal core of G 

S^{G)=Freezing-Influence [G, 
while (Odd-Cycle-Breaking needed) 

y'{G) = Odd- Cycle-Breaking {S^{G)) 

S^{G)=Freezing-Influence {y{G)) 

end 


4-2. Generate Konig — Egervary subgraph by bipartite Vertex-Gover solution space 

By the results in [Tl[23], Konig —Egervary graph has its minimum coverage the same as 
its maximum matching number, and hnding the maximum Konig — Egervary subgraph 
of a given graph G(U, E) is an NP-complete problem. Konig — Egervary subgraph 
provides a witness of the minimum coverage having the maximum matchings’ number 
as a lower bound. However, to hnd such a maximal Konig — Egervary subgraph is 
not always easy as it is NP-complete. Here, we aim to analyze the Konig — Egervary 
subgraph using an edge-adding process, in order to see the organization of Konig — 
Egervary graph in the viewpoint of reduced solution space. In the following, one 
possible way based on our above analysis is provided to obtain a Konig — Egervary 
subgraph. 

We will start the process by a bipartite subgraph Bq, which is easy to be obtained 
from G. Based on Bq (a Konig — Egervary subgraph), we will show how to enlarge 
it to keep the Konig — Egervary property by adding new edges and using the solution 
space expression of Vertex-Cover. Certainly, any new edge between the two independent 
sets Xi and X 2 for a bipartite graph will keep the Konig — Egervary property, as it 
still leads to a bipartite subgraph. Thus, the following steps are provided as a Growth 
Process for Konig — Egervary Subgraph. 

Step 1 Start with the graph G{V,0), hnd a group of maximum matchings of G 
and add all the matching edges to G(l/, 0). 

Step 2 Divide the vertices into two different sets Xi and X 2 by which the two 
ends of any matching are in different sets, add all the edges between different sets, and 
a bipartite subgraph Gs is obtained. 

Step 3 Using the method of Bipartite Graph Solution Space Expression Algorithm, 
the solution space S^{Gb) of the obtained bipartite graph Gb can be achieved. 

Till now, all the edges between Xi and X 2 have been added, and then edges in the 
same sets will be considered. By the following steps, it is ensured that each obtained 
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Figure 8. The generating process of a Konig — Egervary subgraph for a given graph 
G. Subgraph (a) is the original graph, and (b) is its bipartite subgraph expressed 
by the reduced solution graph. Subgraphs (c-f) reveal the growth process of the 
Konig — Egervary subgraphs, which are for the cases in Steps (4-7) correspondingly. 


subgraph is a Konig — Egervary one and denoted by Gke- Evidently, before these 
steps, Gke = Gb- When a new edge G G — Gke is considered to be added to Gke-, the 
following steps work: 

Step 4 If one of its two ends is a covered backbone in S^{Gke), the edge can be 
added directly without changing the solution space of Gke- 

Step 5 If both of its two ends are uncovered backbones in ^{Gke), the edge 
cannot be added into Gke and should be discarded, as it will cause the energy increase. 

Step 6 If one of its two ends is an uncovered backbone with the other unfrozen 
in S^{Gke)) by changing the unfrozen end to covered backbone and doing a Freezing- 
Influence operation on the reduced solution graph of Gke, the edge can be added to 
Gke- 

Step 7 If both of its two ends are unfrozen in S^{Gke), the edge can be added to 
Gke and an Odd-Cycle-Breaking operation should be perform if necessary. 

Step 8 The above Steps (4-7) are repeated until no new edge in G — Gke can be 
added, and the hnal resulted Gke is obtained. 

In FigJHl an example is provided to show the process of getting a Konig —Egervary 
subgraph, which indeed provides a maximal Konig — Egervary subgraph of G. By the 
steps (1-3) and repeating steps (4-7), the obtained subgraph is a Konig — Egervary 
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one, and its coverage equals to the maximum matching number. However, the above 
process cannot always ensure a maximal Konig — Egervary subgraph of a general graph 
G. Indeed, inaccuracy arises from the sequence of adding these edges and the Step 7. 
In Step 7, as there may be many choices to do the Odd-Cycle-Breaking operation |14j . 
it may lead to a contraction of the solution space of Gke, and make wrong decisions 
on discarding more edges in the following steps. Here, the inaccuracy is a complicated 
topic for the NP-completeness of maximal Konig — Egervary subgraph, which will 
not be studied in this paper. Certainly, we can record the solution space for different 
Odd-Cycle-Breaking choices by more than one reduced solution graphs, but when facing 
many cases of Step 7, the space complexity increases exponentially, which leads to the 
NP-completeness of determining maximal Konig — Egervary subgraphs. Generally 
speaking, at least we can make the following conclusion: 

Proposition: The Growth Process for Konig — Egervary Subgraph keeps the sys¬ 
tem energy unchanged and the Konig — Egervary property for each step, and results 
in a Konig — Egervary subgraph. 


5. Conclusion and Discussion 

The solution space description for Vertex-Cover of bipartite graphs with some further 
analysis is investigated in this paper. The exact solution space expression is provided 
by the Bipartite Graph Solution Space Expression Algorithm for bipartite graphs and by 
the Bipartite Core Graph Solution Space Expression Algorithm for bipartite core graphs. 
The states of the nodes on the bipartite graphs, e.g., backbones and unfrozen nodes, 
are determined by the algorithms, and experiments are given to support the analysis of 
their statistics. Specially, as the emergence of the unfrozen core, the statistical results 
become not typical, which implicates the existence of long-range influence of some 
nodes and the replica symmetric breaking phenomenon. The entropy and clustering 
entropy are calculated separately, and some technique named cycle simplihcation is 
proposed to simplify the unfrozen core. Besides, as extended research, some analysis on 
the Konig — Egervary (sub)graph is given to detect the scope of applications for the 
proposed algorithms. 

Though the Vertex-Cover problem on bipartite graph is a P problem in 
computational complexity, clear understanding of its solution space and nodes’ 
correlations can help us a lot to recognize the core difficulty of solving Vertex-Cover on 
general graphs. Indeed, bipartite graphs can be imbedded as subgraphs to view general 
graphs and the modes of nodes’ evolution in it can be used for reference to detect the 
solution organization mechanism on general graphs. Especially, as the proposed Growth 
Process for Konig — Egervary Subgraph, the study on bipartite graph can also provide 
us a schematic view of the obstacles of achieving a maximal Konig—Egervary subgraph, 
which is another NP-complete problem. 
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